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Abstract 

The problem of determining which permutations can be sorted using 
certain switchyard networks is a venerable problem in computer science 
dating back to Knuth in 1968. In this work, we are interested in permuta- 
tions which are sortable on a double-ended queue (called a deque), or on 
two parallel stacks. In 1982, Rosenstiehl and Tarjan presented an O (n) 
algorithm for testing whether a given permutation was sortable on paral- 
lel stacks. In the same paper, they also presented a modification giving 
O (n) test for sortability on a deque. We demonstrate a slight error in the 
version of their algorithm for testing deque sortability, and present a fix 
for this problem. 

The general enumeration problem for both of these classes of permu- 
tations remains unsolved. What is known is that the growth rate of both 
classes is approximately O (8 n ), so computing the number of sortable per- 
mutations of length n, even for small values of n, is difficult to do using 
any method that must evaluate each sortable permutation individually. As 
far as we know, the number of deque sortable permutations was known 
only up to n = 14. This was computed using algorithms which effectively 
generate all sortable permutations. By using the symmetries inherent in 
the execution of Tarjan's algorithm, we have developed a new dynamic 
programming algorithm which can count the number of sortable permu- 
tations in both classes in O (n 5 2 n ) time, allowing the calculation of the 
number of deque and parallel stack sortable permutation for much higher 
values of n than was previously possible. 

Finally, we have examined the problem of trying to sort a permutation 
on a deque when the input elements are only revealed at the time when 
they are pushed to the deque. (Instead of having an omniscient view of the 
input permutation, this corresponds to encoding the input permutation as 
a deck of cards which must be drawn and pushed onto the deque without 
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looking at the remaining cards in the deck.) We show that there are some 
sortable permutations which cannot necessarily be sorted correctly on a 
deque using only this imperfect information. 



1 Introduction 

In 1968, Donald Knuth first posed the question "which permutations can 
be sorted on certain simple data structures such as stacks and queues"? 
[5] More generally, these simple data structure are instances of switch- 
yard networks which take their name by analogy to railroad switchyards. 
Switchyard networks consist of sets of two-way railroad tracks which serve 
as linear storage elements, along with one-way railroad track serving as 
operations for moving the end element from one storage element to an- 
other. Here is a sampling of well known switchyard networks. 
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Figure 1: Some small switchyards 



In the sorting problem posed by Knuth, a permutation n initially sits 
in the input section of the switchyard. Another section of two-way track 
is labeled as the output. The problem, then, is to determine whether or 
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not the elements of n can be moved to the output in sorted order, using 
the operations corresponding to the one-way sections of track. If such a 
sequence of operations exists, we say that the permutation n can be sorted 
by the given network. 

Some networks are very restrictive in the set of permutations which 
they can sort. For example, the only permutation which can be sorted on 
a single queue is the identity permutation itself. Alternatively, it is also 
possible to construct switchyard networks which are capable of sorting 
arbitrary input permutations. Between these two extremes, however, is a 
rich variety of sorting capabilities. 

In considering the question of which permutations could be computed 
on certain switchyard networks (where a permutation is computable on 
a network Af if and only if its inverse is sortable on A/") , Vaughan Pratt 
showed that any switchyard network A/" capable of sorting a permutation 
7r must also be capable of sorting every permutation contained in n, where 
containment is defined in the following way. 6 

def 

= A permutation tt £ S n contains the permutation a g Sk 
if and only if a can be recovered from 7r by removing a (possi- 
bly empty) subset of its elements, and then reducing the values 
of the remaining elements as necessary to remove any gaps (so 
that they consist of exactly the set {1,2,..., k}). If ir does not 
contain a, we say that n avoids a . 

This permutation containment relation is sometimes denoted a < n, 
and it is easy to see that it creates a poset on the set of all permutations. 
Pratt showed that the set of permutations which are sortable on a given 
switchyard network, viewed as a subset of the poset of all permutations, 
is closed under downward containment. Sets of permutations having this 
property have since become a major research area, and have been given 
their own title. 

def 

= A permutation class , C, is a set of permutations such 
that a G C whenever a X tt and n 6 C. 

Another way of defining a permutation class (also dating back to 
Pratt), is to consider the set of minimal permutations not contained in 
that class. Such a set, called the basis of C and denoted Bas(C), can be 
used to determine whether a given permutation is contained in the class 
C. If tv contains some element of Bas(C), then it cannot be in C, since 
that would imply that every permutation contained in tt must also be in C 
by the definition of a permutation class. Conversely, if n doesn't contain 
any element of the basis then it must be in C, since otherwise either it or 
some permutation contained in it must be a minimal permutation not in 
C. 

The basis of a permutation class is clearly an antichain in the poset of 
all permutations. Furthermore, by considering basis with infinite size, any 
permutation class can be described by the set of basis permutations which 
it avoids. This description of permutation classes as sets of permutations 
avoiding certain sets of basis permutations is now the standard represen- 
tation. We notate such a class as C = Av (Bas (C)), and the permutations 
of length n in C by C n = Av„ (Bas (C)). 
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Example. Consider the permutation class which consists of permutations 
sortable on a single stack, C. Knuth showed that a permutation is sortablc 
on a single stack if and only if it avoids the pattern 231. Therefore C = 
Av(231). 

In this work, we are interested in two specific permutation classes. 
The class of permutations which are sortable on two stacks in parallel, 
C, and the class of permutations which are sortable on a double-ended 
queue (also called a deque), T>. These are two of the classes which Pratt 
investigated in his 1973 paper, and he was able to find the basis of both 
of these classes. In each case, the basis is an infinite set which can be 
described by the pattern used to construct basis elements of each length. 

The basis for the class of parallel stack sortable permutations, C, con- 
sists of permutations having length greater than 3 and equivalent to or 
3 modulo 4, which fall in the following pattern 

2 3 4 1 

5 2 7 4 16 3 

2 7 4 16 3 8 5 

92 11 416385 10 7 

2 11 4 1 6 3 8 5 10 7 12 9 



Similarly, the basis of the class of deque sortable permutations, T>, 
consist of four permutations of each odd length greater than 4. One 
representative of each set of four falls in the following pattern 

5 2 3 4 1 

5 2 7 4 16 3 

927416385 

92 11 416385 10 7 

13 2 11 4 1 6 3 8 5 10 7 12 9 

13 2 15 4 1 6 3 8 5 10 7 12 9 14 11 



The other three basis patterns of each length can be recovered by some 
combination of interchanging the first two elements of the permutation, 
and interchanging the largest two elements of the permutation. 

Notice that every odd length pattern from Bas (C) is represented in 
Bas('D). This should not surprise us, since sortability on parallel stacks 
and on a deque are closely linked concepts. In fact, we can view a deque 
switchyard network as being just a parallel stack switchyard network in 
which the bottoms of the two stacks have been joined together to form a 
single linear storage element. Clearly, the permutations which are sortable 
on a deque are a superset of of those sortable on parallel stacks. 

When we set about the investigation leading to this work, our interest 
was primarily in the permutation class T>, the permutations which are 
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sortable on a deque. However, we address C as well, since most of our 
results for T> contain simplifications which pertain to C. 

2 The Enumeration Problem 

One question can be asked about a given permutation class is, "how many 
permutations of length n are in the class" ? Even though Pratt provided 
a full desription of the permutation classes C and T> by giving their basis 
patterns, such a description says almost nothing about the number of 
permutations in the classes of various sizes. 

When presented with the task of enumerating a sequence, such as the 
number of permutations in a given permutation class having length n for 
n — 1, 2, 3, . . ., there are several different forms that an answer can take. 

The most satisfying answer would be an explicit closed form formula 
as a function of n. For example, it has been shown that the number of 
permutations of length n which can be sorted on a single stack is the nth 
Catalan number. (That is, |Avrj (231) | = C„.) 

Another desirable answer is a generating function whose coefficients 
count the desired sequence. Generating functions have been found for 
several permutation classes for which closed form formulas are not known. 

Without a closed formula or a generating function, one is left with 
asymptotic analysis for an inexact view of the long term behavior of the 
sequence, and with algorithms for calculating the nth term for an exact 
view of a limited number of terms at the beginning of the sequence. 

The problem of enumerating the sequences \C\ \ , |C2 1 , . . . and \T>i | , \T>2 \ , • • • 
with a closed form solution or a generating function has gone unsolved for 
40 years. [5] Much of the work that has been done in the enumeration of 
these two classes has been devoted to studying their asymptotic behavior. 

We know that every permutation class having a nonempty basis has a 
growth rate that is at most exponential. Furthermore, every permutation 
class, B, describing permutations which are sortable on some switchyard 
network is supermultiplicative (the number of sortable permutations of 
length m + n is greater than or equal to the product of the number of 
sortable permutations of length m and the number of sortable permuta- 
tions of length n), which implies that the limit 

lim \B n \ " 

71— yoo 

is well defined. This limit is know as the growth rate of the permutation 
class and is denoted gr(23). The sequence enumerating the number of 
in-class permutations of each length then grows like (gr (B)) n . 

Neither gr (C) nor gr (T>) is known exactly, but the best known bounds, 
found by Albert, Atkinson, and Linton in 2009, give a very good estimate 
of what these growth rates must be: 





lower bound 


upper bound 




gr(C) 


7.535 


8.3461 




gr(D) 


7.890 


8.352 
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Notice that, asymptotically, the number of permutations which are 
sortable on parallel stacks must be very close to the number of permuta- 
tions which are sortable on a deque. In fact, Albert et al. have conjectured 
that the growth rates of these permutations may be equal. 

In contrast to the investigation of the growth rates of these permuta- 
tion classes, it seems that comparatively less work has been done on the 
problem of developing algorithms to calculate the terms of the sequence 
explicitly. The first twelve terms of \T>i \ , \T>2\ , • ■ ■ were known to Flajo- 
let, Salvy, and Zimmermann in 1989 [3j. In April of 2012, Zimmermann 
posted the first fourteen terms of this sequence on the online encyclopedia 
of integer sequences (http://oeis.org/A182216), along with a C program 
designed to compute these terms. Zimmermann's program works by con- 
structing words out of the alphabet of operations available to the deque 
switchyard (the alphabet {a, b, y, z}) and determining which permutations 
are sorted by these words. Zimmermann uses some relations in order to 
avoid enumerating all 16™ possible words of length 2n, but even so, this 
approach has an exponential runtime whose base is strictly greater than 
gr(2>). 

The problem of enumerating the sequence \Ci \ , \d \ , . . . is even less well 
known, and to the best of our knowledge, there are no known algorithms 
for computing it in less than u> (gr (C)). We do not know how many terms 
of this sequence are currently known. 

In this work, we will provide two new algorithms for computing the 
leading terms of these sequences. The first, which employs a parallel 
stack/deque sortability testing algorithm by Rosenstiehl and Tarjan has 
a runtime of (n 2 X n ) where X is equal to the growth rate of the rele- 
vant class. Modulo the sub-exponential factor n 2 , this is optimal among 
algorithms which must consider each sortable permutation. By harness- 
ing symmetries inherent in the execution of the Rosenstiehl-Tarjan algo- 
rithm, however, we have developed a second algorithm with a runtime of 
O (n 5 2 n ) . The next several sections of this work are devoted to discussion 
of these algorithms. 

3 The Rosenstiehl-Tarjan Algorithm for 
Parallel Stacks 

Suppose we are given some permutation n, and we wish to determine 
whether it belongs to the permutation class C. We call this the mem- 
bership testing problem. In 1982, Rosenstiehl and Tarjan presented an 
algorithm which can answer this question in linear time (O (n) where 
n is the length of n). 7_ (This runtime is optimal among approaches 
which must read a constant fraction of the permutation tv to determine 
its membership.) Rosenstiehl and Tarjan's algorithm works by using a 
data structure, which they call a pile of twinstacks, which simultaneously 
records all possible configurations of the parallel stack switchyard network 
throughout the process of trying to sort ir. Since we make extensive use 
of Rosenstiehl and Tarjan's algorithm, we present it here in its entirety. 
We begin with a definition of the fundamental data-structure unit used 
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by the algorithm. 



= Let a twinstack , [L, R] , be a pair of stacks, called the left 
stack and the right stack, each of which contains permutation 
elements in strictly increasing order from top to bottom. A 
proper twinstack must always have at least one of its stack 
nonempty. 

The Rosenstiehl and Tarjan algorithm represents the current state of 
two parallel stacks by a stack of twinstack, which Rosenstiehl and Tarjan 
call a pile of twinstacks. Each twinstack in the pile can be subject to 
several operations. A reversal swaps the left and right stacks. A weld 
combines the top two twinstacks into a single twinstack by concatenating 
their left stacks to form the new left stack, and by concatenating their 
right stacks to form a new right stack. 

Clearly, we cannot allow welding in cases where a larger element would 
be concatenated on top of a stack containing a smaller element (since this 
violates our definition of a twinstack). In fact, in general we would like 
to maintain the even stronger condition that each element contained in 
a given twinstack is smaller than every element in every twinstack below 
that twinstack. A pile of twinstacks for which this property holds is called 
normal . 

The intuition behind the Rosenstiehl- Tarjan algorithm is that each 
twinstack represents a degree of freedom in the positioning of the elements 
among the two parallel stacks. A normal pile of k twinstacks represents 
2 k different configurations of parallel stacks. These can be recovered by 
choosing one of 2 k different subsets of the twinstacks in the pile and 
reversing them, and then welding down the entire pile. (By welding down 
the pile, we mean applying successive weld operations to the top pair of 
twinstacks on the pile until only a single twinstack remains.) 

The Rosenstiehl-Tarjan algorithm processes the elements of the permu- 
tation in order, maintaining a normal pile of twinstacks which represents 
all of the elements currently received from the input but not yet sent to 
the output. In each iteration of the algorithm, we first receive the next 
element, i, from the input and place it in its own twinstack on top of the 
pile (the twinstack [i, —]). We then attempt to normalize the pile, in case 
the addition of this new twinstack caused the pile to no longer be normal. 

Normalization Step: Notice that the only element which could possibly 
be larger than any element in a lower twinstack is the new element i (since 
the pile would have been normalized during the previous iteration). Thus, 
we compare the element i to the top element (s) of the stacks of the second 
twinstack, resulting in one of the following cases: 

• If i is smaller than any top elements in the second twinstack, then 
the pile is already normalized, so return from the normalization step. 

• If i is smaller than one top element of the second twinstack, but 
larger than another top element, reverse the twinstack containing i 
in order to position it over the side of the second twinstack whose 
top element is larger than i and then weld. After the weld, the new 
top twinstack contains an element j which is larger than i. Since we 
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know that j is not larger than any elements in lower twinstacks, the 
whole pile must be normalized. Return from the normalization step. 

• If i is larger than one top element of the second twinstack and the 
other side of the second twinstack is empty, reverse the twinstack 
containing i in order to position it over the empty side of the second 
twinstack and then weld it. At this point, i may or may not be 
larger than some element in the new second twinstack (previously 
the third twinstack), so repeat the normalization step. 

• If i is larger than both top elements of the second twinstack, abort 
the algorithm (the given permutation is not sortable). 

Finally, after successfully normalizing the pile, we examine the top ele- 
ments of the top twinstack and move one of them to the output if it is the 
next element belonging there. If this causes the only nonempty stack of 
the top twinstack to become empty, remove it. Repeat this process until 
no more elements can be moved to the output. 

An execution of the Rosenstiehl-Tarjan algorithm has two possible 
results. One possibility is that the the algorithm returns false because it 
was at some point unable to normalize the pile of twinstacks. (Intuitively, 
this corresponds to determining that two elements j and k must be on 
opposite stacks (as represented by having them on opposite sides of the 
same twinstack), both of which are still in the stacks when a new larger 
element i arrives from the input. Whichever stack i is placed on, it must 
necessarily pin j or k underneath it, preventing that element from ever 
making it to the output.) The second possibility is that, after n iterations 
of the algorithm, every element has been moved from the input and into 
the output in sorted order. In this case the algorithm returns true. 

It is worth noting that we can also recover the sequence of operations 
for sorting a permutation which the Rosenstiehl-Tarjan algorithm deems 
sortable. However, for our purposes we are only interested in the boolean 
result telling whether or not the given permutation is sortable. 

4 The Rosenstiehl-Tarjan Algorithm for 
Deques 

After the main results of their paper, Rosenstiehl and Tarjan also pro- 
vided, as an aside, a modification of their algorithm to allow testing sorta- 
bility on deques. They write: 

We use the same algorithm as in the case of twin stacks, ex- 
cept that we process an element i larger than anything on the 
[pile of twinstacks] as follows. Add a new twinstack [i, —] to the 
bottom of the [pile of twinstacks]. If any twinstack [Li, Ri] has 
both Li and Ri nonempty, abort. Otherwise, reverse as neces- 
sary to make all the _R;s empty, and weld all the twinstacks in 
the [pile of twinstacks] . [7j 

The intuition behind this modification is simple. Whenever we add a 
new element from the input to a deque, it is safe to add that element as 
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long as the deque can be arranged in monotonic order. Clearly, if all of the 
twinstacks on the pile have one empty side, then there is an arrangement 
of elements which is monotonic. If the standard parallel stack sortability 
algorithm was used to add a new maximal element i at this point, this 
would result in the entire pile of twinstacks being welded together with i 
on one side, and all of the previous contents of the twinstack on the other. 
Clearly the deque is still monotonic, but this is no longer evinced by the 
absence of double-sided twinstacks. Thus if we were to subsequently add 
another larger element, we would fail during the normalization step. 

Rosenstiehl and Tarjan's approach, therefore, is to essentially tuck the 
element i underneath the side stack containing all of the other elements 
after welding. Thus the pile continues to contain only one-sided twin- 
stacks. 

We have found, however, that there is a small error in the modifica- 
tion that Rosenstiehl and Tarjan give in their paper. Notice that their 
algorithm will return false whenever a new maximal element is received 
from the input at a time when the pile contains a double-sided twinstack. 
Thus their algorithm depends on the invariant that the pile never contains 
a double-sided twinstack as long as it can represent a monotonic deque 
state. 

Whenever a normalized pile contains a double-sided twinstack apart 
from the bottom twinstack, then every possible state of the deque must 
necessarily be non-monotonic. Similarly, if the bottom twinstack is double- 
sided, and both sides contain more than one element, or there is an element 
which is larger than both top elements, then the deque must necessarily 
be non-monotonic. Therefore, the problem case we must watch to avoid 
is where only the bottom twinstack is double-sided, and one side of this 
twinstack contains a single element larger than every element on the other 
side. This is the only case where the pile can contain a double-sided twin- 
stack while simultaneously representing a monotonic deque state. 

The special treatment that the modification gives to the introduction 
of a new maximal element ensures that the algorithm never creates a 
double-sided twinstack so long as the deque remains monotonic. However, 
this does not protect against the case where the popping of an element to 
the output causes a deque to become monotonic. It is possible that, in a 
non-monotonic state, the pile can contain a double-sided twinstack. Then, 
by popping an element to the output, the state can become monotonic 
without removing the double-sided twinstack. 

To give a concrete example, consider the permutation 254163. This is 
clearly deque-sortable: 

Now consider running this through the stated version of the algorithm: 
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Figure 2: Sorting 254163 
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The problem here is clearly that popping the element 2 to the output 
results in the state becoming monotonic, even though the bottom (and 
only) twinstack remains double-sided. The fix for this problem is very 
simple. Whenever we pop an element from the bottom twinstack, we 
need to check if the resulting state is monotonic. If it is, we rearrange 
the bottom twinstack as necessary (by tucking the largest element at the 
bottom of the stack containing the other element) to make it one-sided. 

5 Correctness of Rosenstiehl-Tarjan-Modified 

Since Rosenstiel and Tarjan did not give a full proof of correctness of their 
algorithm for testing sortability on a deque, and since we have shown that 
some modifications need to be made to to fix this algorithm, it should be 
worthwhile to take the time to fully prove the correctness of the new 
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version of the algorithm which we will call Rosenstiehl-Tarjan-Modified. 
We intend to prove this by considering a mapping relating the states of 
a run of Rosenstiehl-Tarjan-Modified to the states of a sorting run on an 
actual deque. Therefore, we will begin by examining what these states 
are. 

The state of a deque switchyard can be thought of as consisting of 
three lists. The first list is the output, which contains all the elements 
which have already been popped from the deque in the order in which 
they were popped. The second list is the deque itself. This list contains 
the elements which, at the current point in the run, have already been 
pushed from the input onto the deque, but have not yet been popped. The 
final list comprising the deque switchyard state is the input list. This list 
contains a suffix of the input permutation consisting of all those elements 
which have not yet been pushed onto the deque. 

At all times, the combined three lists of the deque state contain all n 
elements of the input permutation n. The transition rules are governed 
by the four operations allowed in sorting on a deque. Whenever the input 
list is nonempty, we are allowed to take operation a, by removing the 
first element of the input list and adding it to the left end of the deque 
list. Alternatively, we can make a state transition by taking operation b: 
removing the first element of the input list and adding it to the right end 
of the deque list. The other two operations, y and z, involve removing 
the left or right end element of a nonempty deque list, and placing the 
removed element at the end of the output list. 

Let 2) be the set of all states of a deque switchyard containing n 
elements. Then each of the operations a, b, y, and z defines a map on a 
subset of 2) into 2). Alternatively, we can view 23 as the vertex set of 
a simple acyclic directed graph, where each vertex has between zero and 
four out-edges, labeled with the operations from {a, b, y, z} corresponding 
to the represented transitions. Note that some edges may have multiple 
labels since, for example, the operations a and b correspond to the same 
state transition whenever the deque list is empty. Alternatively, some 
operations may not be represented among the labels on the out-edges 
from some nodes. This is the case for the operations y and z for any state 
whose deque list is empty. Also note that this graph represents all possible 
states for a deque switchyard containing n elements. This includes many 
states in which the output contains elements which are out of order. 

We speak of a run of a permutation n on a deque switchyard to refer 
to a walk on this directed graph, starting at the the state where the out- 
put and deque are empty and the input list contains the full permutation 
7T. The run then consists of a series of states connected with edges each of 
which is labeled with at least one of the operations a,b,y, and z. A run 
is successful if it takes In steps and then ends at the unique state con- 
taining the identity permutation 1 ... n in the output list. A permutation 
7T is sortable on a deque if and only if there is a successful run of that 
permutation on a deque. 

Since the out-edges from each vertex of 2) are each labeled with a 
nonempty subset of {a, b, y, z}, for each run of a permutation jrona deque 
switchyard we can can construct a corresponding word in the alphabet 
{a, b, y, z} by choosing one letter from the label set for each edge along the 
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run. Furthermore, given such a word, we can easily determine whether it 
corresponds to a valid run (in terms of only selecting operations which are 
available at a given state). A word in the alphabet {a, b, y, z} represents a 
valid run of a permutation it or length n if and only if it contains at most 
n combined occurrence of the letters a and 6, and every prefix of the word 
contains at least as many combined occurrence of a and b as of y and z. 

def 

= We call a run of the permutation i on a deque switch- 
yard reduced if every state in which an element i at one of the 
ends of the deque is the next element required by the output 
is followed by a state in which that element i has been popped 
from the deque and moved to the output. 

We are interested in reduced runs because it is convenient to design 
our algorithms to only explore reduced runs, by sequentially moving one 
element from the input to the deque, and then moving elements from 
the deque to the output until we are unable to continue to do so. The 
following lemma shows that this choice is not restrictive. 

Lemma 5.1. Every permutation n which can be sorted on a deque can 
be sorted using a reduced run on a deque. 

Proof. Suppose that we are given some permutation, ir, that is sortable on 
a deque. Then there exists some successful run sorting that permutation. 
Call this run r. 

Let u be a word in the alphabet {a, b, y, 2} which corresponds to r. 
(Recall that there can be multiple such words, but there must always be 
at least one such word.) Suppose that r is not a reduced run. Then 
there is some element, i, which is not moved to the output as soon as 
possible. However, since r is a successful run, i must be moved to the 
output eventually. 

Let u>j be the letter in the word u> which corresponds to the state 
transition wherein i is moved to the output, ujj must be either y or z. 
Suppose that ujj = y. Consider the first opportunity to move i to the 
output. At no point in-between then and the step corresponding to uij 
can there be any element to the left of i. This is because, as soon as we 
are ready to move i to the output, all elements from 1 through (i — 1) 
have already been moved to the output, so there will not be any y or z 
operations preceding the one which outputs i. Since the left side of i is 
free at the time corresponding to cjj , it must have been free for the entire 
intervening period. Thus we can construct a new run, r' , by moving the y 
operation which takes i to the output to the earliest possible opportunity. 
(The same result holds for ojj = z by symmetry.) 

The new run, r' , has one fewer elements which is not popped at the first 
opportunity than r did. We can repeat this process until we arrive at a run 
which has no elements which are not popped at the first opportunity. This 
resulting run is reduced. Therefore, the arbitrary sortable permutation n 
can be sorted using a reduced run. □ 

We would also like to consider what can cause a run to not be suc- 
cessful. Notice that, whenever the state of the deque switchyard is such 
that there is an element i on the deque which is sandwiched between two 
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larger element, then there is no successful run including this state. This is 
because one of the two sandwiching elements must be moved to the output 
before i can be moved, but this will result in the output being unsorted. 
We call such states sandwich states, and we seek successful runs among 
those runs which avoid these sandwich states. 

We are now ready to consider the states of the Rosenstiehl-Tarjan- 
Modified algorithm, and the mapping which takes them to reduced runs on 
the deque. The state of Rosenstiehl-Tarjan-Modified also consists of three 
parts. The output is again a list of elements which have been popped in 
the order in which they were popped. The input is again a list containing 
a suffix of 7r with all those elements which have not yet been pushed. 
Instead of a deque list, however, the third element of the Rosenstiehl- 
Tarjan-Modified state is the stack of twinstacks called the pile. Like the 
states of the deque switchyard, the stacks and lists of the Rosenstiehl- 
Tarjan-Modified algorithm always contain all n permutation elements. 
Let SH denote the set of all possible states of the algorithm. 

Here is the psuedocode of the Rosenstiehl-Tarjan-Modified algorithm: 

Procedure Fromlnput (O, P, I) 

x = I.dequeueQ 

add the new twinstack (x, — ) to the top of P 
return 
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Procedure Normalize (O, P, I) 

topTStack = P.topQ 
secondT Stack — P.secondQ 
if secondTStack=—NIL then 

L return TRUE 
x = topT 'Stack. leftQ.topQ 

/* note that x is the only element in topTStack which can 
possibly be greater than some element in secondTStack 
*/ 

switch do 

case secondT Stack, right () nonempty and x less than both top 
elements of secondTStack 
return TRUE 

case secondTStack. right () nonempty and x in-between the two 
top element of secondTStack 

weld down 

return TRUE 

case secondT Stack, right () nonempty and x greater than both 
top elements of secondTStack 

return FALSE 
case secondTStack. right () empty and x less than 
secondTStack. left().top() 

return TRUE 
case secondT Stack, right () empty and x greater than 
secondTStack. left().top() 

if secondTStack is the bottom twinstack and x is larger 

than every element in secondTStack then 

place x at the bottom of the nonempty side of 
secondTStack 

reverse topTStack and weld down 
else 

reverse secondTStack and weld down 
return Normalize (O, P, I) 
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Procedure ToOutput (O, P, 7) 

topTStack = P.topQ 

if topTStack.right().top()==0.last()+l then 
O . enqueue (topTStack .rightQ .pop() ) 

reverse topTStack if necessary to put the largest top element 
on the left stack 

if the pop was from the bottom twinstack and it caused that 
twinstack to become monotonic then 

reorganize topTStack to be one-sided, reflecting its 

monotonicity 

ToOutput (O, P, 7) 
return 

else if topTStack.left().top()==0.last()+l then 
O . enqueue (topTStack -leftQ .pop () ) 
pop topTStack if it is now empty 

otherwise reverse topTStack if necessary to put the largest top 
element on the left stack 
ToOutput CO,P, I) 
_ return 

else 
L return 



Procedure Rosenstiehl-Tarjan-Modif iedCO, P, 7) 
while not I.isEmpty() do 
FromInput(0,P,7) 
if not Normalize (O, P, 7) then 

L return FALSE 
ToOutput (O, P, 7) 

return TRUE 

As discussed previously, the idea behind the Rosenstiehl-Tarjan-Modified 
algorithm is that we simultaneously represent several possible states of a 
deque sorting attempt by representing degrees of freedom with the ability 
to reverse each twinstack on the pile independently. Each twinstack is 
meant to hold elements which belong "outside of" the elements in the 
twinstacks below it. If we were to construct an actual deque list from 
the pile of twinstacks, we would choose an orientation for each twinstack. 
Starting with the empty deque and the bottom twinstack, we add the 
elements from the left side of the twinstack to the left side of the deque 
and the elements from the right side of the twinstack to the right side of 
the deque if we selected the default orientation. Alternatively, if we select 
the reversed orientation, then we put the left stack elements on the right 
side of the deque and the right stack element on the left side of the deque. 
Clearly, the orientation that is chosen for the bottom twinstack may or 
may not matter (depending on whether the bottom twinstack contains 
one or several elements), but each change of orientation for a non-bottom 
twinstack results in a different deque state. 

We use the term realization to refer to this process of choosing orienta- 
tions for the twinstacks in the pile and turning them into a deque. Every 
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pile containing 2 k twinstacks can be realized as either 2 k or 2 fc ~ 1 different 
deques, depending on whether the bottom twinstack contains exactly one 
permutation element. Since we require every twinstack to be nonempty, 
there can only be a maximum of n twinstacks (and if there are n twin- 
stacks, then the bottom twinstack contains exactly one element). Thus 
every pile can be realized by at most 2"~ 1 different deques. This process 
of realization is the mapping which we will use to prove the correctness 
of the Rosenstiehl-Tarjan-Modified algorithm. Let be a mapping 

<f>: (Z/2Z) 11 - 1 xK^S 

defined by choosing an orientation for every stack starting with the bot- 
tommost stack whose orientation matters (the bottommost stack if it has 
two elements and the second from the bottom otherwise), and then com- 
bining their elements as described into a single deque list. 

def 

= We call cf> the realization mapping . We say that a state 
d G 55 is a realization of r G 9i if there exists some a G 
(Z/2Z) n_1 such that d = c/>{a,r). 

Just like we define runs of a deque switchyard as walks on a graph 
with vertices in D, we would like to define runs of the Rosenstiehl-Tarjan- 
Modified algorithm to be walks on a simple acyclic directed graph with 
vertices in fR. The edges, in this case, represent the states transitions that 
can be accomplished during the course of execution of the Rosenstiehl- 
Tarjan-Modified algorithm. (Since the algorithm is deterministic, every 
vertex has at most one out-edge, and the entire run is determined by the 
choice of starting vertex.) Once again, a successful run will be a walk 
starting with the permutation it in the input list, and ending after 2n 
steps with the identity permutation in the output list. 

We now generalize the notion of a realization from single states to 
entire runs. 

def 

= We say that a deque run do, . . . , dy is a realization of a 
Rosenstiehl-Tarjan-Modified run ro, . . . , r k if it is a valid deque 
run, and there exists a sequence of binary numbers ao, . . . , a k G 
(Z/2Z) n_1 such that the sequence <j> (ao, ro) , ■ ■ ■ , 4> (at, rjt) is 
equal to do, ■ ■ ■ , dy except possibly with repetitions of states. 

Lemma 5.2. If r = ro, 7*2, . . . , ru is any run of the Rosenstiehl-Tarjan- 
Modified algorithm starting with it in the input and ending after k steps 
at r k G SH, then for any a G (Z/2Z) n_1 there exists a realization of r 
which ends at the state 4> (a, r k ) G 2). 

Proof. We give a proof by induction on k. 

base case (k = 0): Clearly the one and only realization of the Rosenstiehl- 
Tarjan-Modified run of trivial length starting with input tt is mapped to 
the one and only valid deque run of trivial length starting with that input 

7T. 

inductive case: Assume that the statement holds for k — 1 steps. Con- 
sider the kih step of run r. The Rosenstiehl-Tarjan-Modified algorithm 
can bring about change in its state in a couple of ways. 
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1. A new twinstack could be added to the top of the pile by the Fromln- 
put subroutine. 

2. An element can be moved from the pile to the ouput (possibly with 
the removal of the containing twinstack). 

3. The top twinstack could be welded down. 

4. A twinstack could be reversed. 

5. The single largest element of the bottom twinstack, which is cur- 
rently residing as the only element in the stack in one side of that 
twinstack, can be moved to the bottom of the other stack of that 
twinstack. (This can occur in the ToOuput and Normalize subrou- 
tines.) 

The last two possibilities, reversing a twinstack and rearranging the bot- 
tom twinstack, do not affect which states of the deque switchyard can be 
realized. Therefore, the exact same run of the deque switchyard which 
realizes ro, ■ ■ ■ , ru-i can realize ro, ■ ■ ■ , r k by repeating the last state of 
the realization. 

In the third possibility, where the top twinstack is welded, the states 
of the deque switchyard which are realizable from r k are a subset of those 
realizable from r k -i- Thus, for every realization of r k , there is already 
a run of the deque realizing ro, - - - ,rk-i which ends at that state. This 
can be extended to realize ro, . . . ,r k by repeating the last state of the 
realization. 

In case 2, we are moving one element, x, from the pile to the out- 
put. By the definition of realization, it is clear that every realization of 
ro, . . . , r/t-i ends with a deque switchyard state having the element x as 
one of the ends of the deque. Let S pop C D denote the set of states which 
can be transitioned to from realizations of r k -i via a pop operation send- 
ing x to the output. If any of these is not a realization of r k , then the 
state it is a transition from must not be a realization of rfc_i, a contradic- 
tion. Thus Spop is a subset of the realizations of r k . Our goal is to show 
that it is equal to the set of realizations of r k , since this would mean that 
there is a transition from a realization of r k -i to a realization of r k which 
can be appended to a specific realization of ro, . . . , r k -i (by the inductive 
hypothesis) to give the full desired realization of ro, . . . , r k - 

In the trivial case, where x was the only element in the pile, there is 
only one realization of must be nonempty, so this implies that 

Spop is equal the set of all realizations of r k . 

Now consider the case where the pile contains at least one element 
besides x. We know from the inductive hypothesis that there are real- 
izations of ro, . . . ,r k -i ending at every possible realization of r^-i. Let 
m k -i denote the number of such possible realizations. 

If the moving of the element x to the output causes the top twinstack to 
be popped, then the number of different realizations of r k is mfc 2 -1 . Notice 
that no state in D ever has more than two incoming edges corresponding 
to pop operations sending a fixed element x to the output. Therefore, the 
cardinality of 5 pop is greater than or equal to mfc 2 -1 . So Spop must equal 
the set of all realizations of r k . 
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Alternatively, if the movement of x to the output did not cause the 
top twinstack to be popped, then the deque lists of every realization of 
Vk-i must still be unique after the removal of x. Therefore, the cardinal- 
ity of Spop must be rrik-i, implying that it is equal to the whole set of 
realizations of r k . 

Finally, in case 1 we are adding a new twinstack containing the ele- 
ment x to the top of the pile. Consider an arbitrary a giving an arbitrary 
realization of rk- By the inductive hypothesis, there is a realization of 
ro, . . . , r fc _iending in <f> (a, r k -i). But <f> (a, ru-i) is a state of the deque 
which can clearly transition to <j>(a,rk) by taking either an a or 6 opera- 
tion. Thus there is a realization of ro, . . . , r k ending at d k = <j> ( a > r fe)- 

Thus, if r = ro, ri, . . . , r k is any run of the Rosenstiehl-Tarjan-Modified 
algorithm starting with 7r in the input and ending after k steps at r k £ £K, 
for every realization of r k , there exists a valid run of the deque switchyard 
which realizes ro, . . . , r k and ends at that particular realization of r k . □ 

Lemma 5.3. After every run of the Normalize subroutine which returns 
true, every realization of the state of the Rosenstiehl-Tarjan-Modified al- 
gorithm is a non-sandwich state. Additionally, when the Normalize sub- 
routine returns false, this is because every realization of the state at the 
start of that subroutine was a sandwich state. 

Proof. Suppose for the sake of contradiction that the Normalize subrou- 
tine returns true, and that there is a realization which is a sandwich state. 
The only way that there could be a realization as a sandwich state is if 
some realization puts an element i between two larger elements, j and k. 
Consider the empty deque list, to which we begin adding elements from 
the stacks of our twinstacks in order to form a realization. In forming a 
realization, each element must go either to the left or right of this initial 
empty list. Clearly, i cannot go on the opposite side from j and k while 
still appearing between them in the realization. Thus at least one of j, k 
must go on the same side as i and on the outside of i with respect to the 
position of the initial empty list. 

Assume without loss of generality that the element i is located between 
the initial empty stack and the element j in the realization as a sandwich 
state. Then j must have been located either above i in the same stack 
of the twinstack containing i, or j must be in a twinstack above the one 
containing i. But this is a contradiction, since it is clear that the pile 
is normalized every time the Normalize subroutine of Rosenstiehl-Tarjan- 
Modified returns true. Thus, Normalize cannot return true unless every 
realization of the algorithm state is a non-sandwich state. 

Now we wish to show that, when the Normalize subroutine returns 
false, every realization of the state at the start of that subroutine call was 
a sandwich state. Suppose that Normalize returns false. Then, at the time 
of the first execution of the return, the second twinstack is double-sided, 
and the top elements of both of its sides are smaller than an element 
x in the top twinstack. The previous calls of Normalize, if any, only 
modified the state by welding together elements of the top two stacks. 
Thus, when Normalize was first called, this double-sided twinstack was 
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already double-sided, and the element x was already in a twinstack above 
the double-sided twinstack. Call this state r G 91 

By making sure that the bottom twinstack is always one-sided if it 
is monotonic, we ensure that the presence of a double-sided twinstack 
ensure that any realization of that twinstack and all those below it must 
be non-monotonic. Thus, any realization of the state r must have the top 
elements of the double-sided twinstack separated by an element larger 
than both of them. So, wherever the realization places x, it will sandwich 
one of these top elements of the double-sided twinstack. Therefore, every 
realization of r is a sandwich state. □ 

Theorem 5.4. The Rosenstiehl-Tarjan-Modified algorithm is correct. 
That is, it returns true for a permutation n if and only if there is a valid 
run of a deque switchyard which sorts the permutation n. 

Proof. Suppose that the Rosenstiehl-Tarjan-Modified algorithm 

returns true. Then, at the time of return, its input list must be empty. 

We claim furthermore, that at the time of return, every element of the 
permutation is in the output list in sorted order. Clearly every element 
that is in the output list must be in sorted order. Suppose, for the sake 
of contradiction, there are elements remaining in the pile at the time of 
return. Since the pile was normalized prior to the final call to ToOuput, 
the smallest element not on the output must have been one of the top 
elements of the top twinstack. But this would imply that it would have 
been moved by ToOuput. Thus we have a contradiction, and the pile must 
be empty at the time when the algorithm terminates with the return value 
true. Therefore, at the time of return, every element of the permutation 
is in the output list in sorted order. 

At the time of return, the states taken by the algorithm represent 
a run of Rosenstiehl-Tarjan-Modified staring with the state where it is 
in the input, and ending with the state where 1 . . . n is in the output. 
By the previous lemma, every such run is realizable as a run of a deque 
switchyard, with realizations for each possible realization of the final state 
of the Rosenstiehl-Tarjan-Modified run. Any such realization in this case 
represents a successful sorting of tt. Therefore 7r is deque sortable. 

(^=>): Now suppose that n is deque sortable permutation. The one 
remaining result we need to show that Rosenstiehl-Tarjan-Modified will 
return true is the following subclaim. 

Subclaim. Suppose a run of Rosenstiehl-Tarjan-Modified transitions from 
a state r € 5H to r' G 9t. Let d be any realization of r which is not a sand- 
wich state, and let d! be any non-sandwich state reachable by taking a 
valid step from d as part of a successful reduced run of the deque switch- 
yard. Then, either d is a realization of r' , or d! is a realization of r' . 

Proof. Once again, we consider all possible state transitions of the Rosenstiehl- 
Tarjan-Modified algorithm. 

1. A new twinstack could be added to the top of the pile by the Fromln- 
put subroutine. 
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2. An element can be moved from the pile to the output (possibly with 
the removal of the containing twinstack). 

3. The top twinstack could be welded down. 

4. A twinstack could be reversed. 

5. The single largest element of the bottom twinstack, which is cur- 
rently residing as the only element in the stack in one side of that 
twinstack, can be moved to the bottom of the other stack of that 
twinstack. (This can occur in the ToOuput and Normalize subrou- 
tines.) 

In the last two cases, and realization before the transition is still realizable 
after the transition, so the statement holds. 

In case 3, the only realizations being eliminated by the weld are those 
where some large element j would have gone on the outside of a smaller 
top element i. Such realizations are always sandwich states unless i is the 
only element in all twinstacks below the one containing j. This situation 
would be handled by the 5th case instead, so we can safely conclude that 
any realizations eliminated by the weld are sandwich states. 

Now consider case 2. In this case we are popping an element which 
can be placed on the output. According to the definition of a reduced 
run, the only step we could possibly be taking from any realization d of r 
would be to pop this element. Therefore, every d! which is reachable by a 
valid step of a reduced run of the deque switchyard is a realization of r' . 

Finally, consider case 1. Here, we are transitioning from r to r' by 
pushing a new element from the input onto the pile as its own new twin- 
stack. Since the Fromlnput call involved in this transition from r to r' was 
immediately preceded by a call to ToOuput, the state r cannot have any 
elements which are available to be popped. Therefore, the only possible 
transitions from d to d' as part of a successful reduced run on the deque 
switchyard are caused by the operations a and b, and the two resulting 
states are both realizations of r' . 

Therefore, in every case, if d is any realization of r which is not a 
sandwich state, then either d is a realization of r' , or every d! which is 
reachable by a valid step from d as part of a successful reduced run of the 
deque switchyard is realizable from r' . □ 

Since n is deque sortable, there must be a successful run of the deque 
switchyard starting with n in the input and ending with the identity per- 
mutation in the output. By our lemma, there must also be a successful 
reduced run. Denote this successful reduced run by d = do, ■ ■ ■ , di n - 

Because the initial state do is a realization of the initial state of the run 
of Rosenstiehl-Tarjan- Modified on the permutation it, the above subclaim 
shows that, as we run the Rosenstiehl-Tarjan-Modified algorithm, every 
state do through dk will be included as realizations of this run, where dk 
is the state which is a realization of the current state of the Rosenstiehl- 
Tarjan-Modified at the time of termination. 

Suppose for the sake of contradiction that the Rosenstiehl-Tarjan- 
Modified were to return false. Then our lemma shows that every real- 
ization of the current state of the Rosenstiehl-Tarjan-Modified algorithm 
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must be a sandwich state. But dk is a realization which is not a sand- 
wich state, which gives a contradiction. Therefore the Rosenstiehl-Tarjan- 
Modified algorithm must return true. 

So the Rosenstiehl-Tarjan-Modified algorithm returns true for a per- 
mutation 7r if and only if there is a valid run of a deque switchyard which 
sorts the permutation tt. □ 

6 Applying Rosenstiehl-Tarjan-Modified 
to the Enumeration Problem 

The naive approach to calculating the number of sortable permutations 
of a given length n would be to enumerate all permutations of length n 
and then test each one for sortability using the appropriate version of the 
Rosenstiehl-Tarjan algorithm. This approach has the ghastly runtime of 
e(n-n!). 

A much better approach is to search the permutation tree, pruning 
subtrees of permutations which are not sortable. Consider the tree where 
each node at depth k is a permutation of length k, and its children are the 
permutations of length (k + 1) formed by inserting the element (k + 1) at 
the (k + 1) possible insertion locations. 



12 21 




123 132 312 213 231 321 




Figure 3: Searching the permutation tree 

Clearly, this tree is the poset of all permutations (oriented upside 
down, and with only some of the connections shown). Since permutations 
classes are closed under downward containment in the poset of permuta- 
tions, they are closed under upward traversal of this tree. Therefore, any 
node of the tree whose permutation does not belong to a given permuta- 
tion class is the root of a subtree which does not contain any members 
of that class. We use this property to prune a depth first search of the 
permutation tree for sortable permutations of length n. 

The algorithm for calculating \C n \ or \T> n \ is thus given as follows: 

• Start at the root of the permutation tree and traverse it via depth 
first search. 

• At each node, test the permutation for sortability using the appro- 
priate version of the Rosenstiehl-Tarjan-Modified algorithm. If the 
permutation is not sortable, backtrack. 
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• Whenever a permutation of length n is found to be sortable, incre- 
ment the number of sortable permutations, then backtrack. 

The runtime of this algorithm clearly depends on the number of nodes 
visited. Since every node visited is the child of some node at the previous 
depth whose permutation is sortable, and each node at depth (k — 1) has 
k children, the number of nodes visited is given by the following formula 
(for the parallel stack sortability case). 

n 

# nodes visited = i |Cj-i| where we define \Co\ = 1 

i=l 
n 

<n^(gr(C)r 1 
i=i 

n-l 

= n^(gr(C)r 

i=0 

(gr(C))"-l 
gr (C) - 1 

= 0(n.(gr(C))") 

Combining this calculation with the linear runtime at each visited 
node, we see that the runtime of this enumeration algorithm is O (n 2 ■ (gr (C)) 
(Similarly, for the deque sortability case we derive a runtime of O (n 2 ■ (gr (£>)) 

Modulo the sub-exponential factor, this is optimal among algorithms 
which must consider every sortable permutation. This is asymptotically 
superior to the runtime of Zimmermann's C program (which can be though 
of as having a runtime of O ((gr (T>) + A) n ) for some positive constant such 
that gr (T>) < (gr (D) + A) < 16), though the runtime difference is not 
sufficient to change the range of values of n for which the calculation can 
be reasonable performed. We wrote an efficient C implementation of this 
algorithm which calculated the first 14 terms of the sequences \Ci \ , IC2I , . . . 
and \V\ I , \T>2\ , • • • (the same terms that Zimmermann calculated with his 
algorithm) . 

We now transition to the construction of a new algorithm whose op- 
eration does not depend on examination of each sortable permutation. 

7 The Relativistic Algorithm for Count- 
ing Parallel Stack Sortable Permutations 

As with the Rosenstiehl-Tarjan-Modified algorithm itself, we will develop 
the new algorithm by first considering the simpler case of computing |C„|, 
and then progressing to a version of the algorithm which can calculate 

Let us define a run of the Rosenstiehl-Tarjan-Modified algorithm as 
before, as a walk on the state graph whose vertices are the elements of 
91. Let us call a successful run of the algorithm an R-T history . The key 
idea behind the new algorithm is that, instead of counting the number of 
parallel stack sortable permutations directly, we can count histories of the 
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Rosenstiehl-Tarjan-Modified algorithm. We demonstrate the equivalence 
of these two approaches with the following lemma. 

Lemma 7.1. The set of histories of the Rosenstiehl-Tarjan-Modified al- 
gorithm for parallel stacks (respectively deques) is in bijection with the 
set of parallel stack sortable permutations (respectively deque sortable 
permutations). 

Proof. (D): Different sortable input permutations always produce distinct 
runs, and since Rosenstiehl-Tarjan-Modified is correct, they produce dis- 
tinct histories. Therefore, there are at least as many R-T histories as 
there are sortable permutations. 

(C): Consider any set of distinct R-T histories. Since the R-T al- 
gorithm is deterministic, they must differ in their first state, and thus 
in there input permutation. By the correctness of Rosenstiehl-Tarjan- 
Modified, each of these permutations must in fact be sortable. Therefore, 
there are at least as many sortable permutations as there are R-T histo- 
ries. □ 

Counting R-T histories is still not an easy task. The state space SH is 
still very large and complex. We (Doyle) noticed, however, that there is a 
great deal of symmetry between states whose pile of twinstacks have the 
same relative orders of elements. Thus, we are going to consider an new 
state space designed to better take advantage of these symmetries. 

= Let the relativistic twinstack (or an r-twinstack) cor- 
responding to a given twinstack be the structure obtained by 
forgetting all of the labelings of the elements and remembering 
only their relative orders. (So an r-twinstack containing k ele- 
ments can be represented by a binary string of length k, where 
the ith smallest element is in the left stack if and only if the 
ith number in the string is a 1.) 



Figure 4: An example r-twinstack and one classic twinstack mapping to it 

Short aside: Rosenstiehl and Tarjan require their twinstacks to be 
nonempty, but we will see that it is convenient for us to also consider "the 
empty r-twinstack" . 

Notice that almost all r-twinstacks correspond to several different twin- 
stacks. Additionally, the possibilities for an r-twinstack with k elements 
involved in the sorting of a permutation of length n are the same as the 
possibilities for an r-twinstack involved in the sorting of a permutation 
of length m as long as k < n, m. Thus, by rephrasing our state space in 
terms of r-twinstacks, we make it much easier to phrase the counting of 
histories in terms of subproblems. This motivates the following definition. 
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= Let a r-state consist of a pile of r-twinstacks, along with 
a count of the number of elements in the input and the number 
of elements in the output. 

There is a natural surjective mapping from states of the Rosenstiehl- 
Tarjan-Modified algorithm to r-states. Let the set of all r-states be de- 
noted £. Just as we did for the states in $R, we will consider the simple 
directed graph formed by valid state transitions of the Rosenstiehl-Tarjan- 
Modified algorithm, except that we consider a transition from one r-state 
to another to be valid if and only if there is a pair of states in SH which 
map to the r-states and between which there is a valid Rosenstiehl-Tarjan- 
Modified state transition. Notice that just as the correspondence between 
r-states in £ and states in £R can be one-to-many, the correspondance 
between edges in the transition graph on vertex set <£ and the transition 
graph on vertex set 5H can also be one-to-many. 

Furthermore, while the transition graph on 5H had at most one out- 
edge from any vertex, a single vertex in the transition graph on £ can 
have many out-edges since there can be many possible transitions from 
a given r-state depending on the relative order of the new element taken 
from the input. 

Example: 



If the new element stays 




If the new element is immediately popped 



Figure 5: Example of possible transitions from a given r-state 
We next go on to give the analog of our definition of an R-T history. 
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= An r-history is a length n path in the r-state graph on 
vertex set <£ which has a lifting to the state graph on vertex set 
9^ as an R-T history (or as a successful run of the Rosenstiehl- 
Tarj an- Modified algorithm). 

The next result is slightly more surprising than the bijection between 
R-T histories and sortable permutations. 

Lemma 7.2. The set of r-histories on the transition graph of r-states 
with n elements is in bijection with the set of sortable permutations of 
length n. 

Proof. Suppose we are given an r-history, a. Notice that the final r- 
state of a is the unique r-state with n elements in the output. Call this 
state x. Since every r-history lifts to an R-T history, and every R-T 
history ends with the identity permutation in the output, we can label 
each of the elements in the output of the r-state x so that they form the 
identity. Then, by following a in reverse, it is possible to track the labels 
on the elements as they propagate back from the output, into the pile of 
r-twinstacks, and into the input. 

Thus, by running a in reverse, we can determine the unique input 
permutation which could have lead to the that r-history. Since the inverse 
of this map is the map taking a sortable permutation to the R-T history 
that it generates and then mapping that R-T history to an r-history in 
the natural way, this map is clearly a bijection between r-histories and 
sortable permutations. □ 

Lemma 7.2 implies that, instead of trying to count either the number 
of sortable permutations or the number of Rosenstiehl-Tarjan-Modified 
histories, we can simply count the number of r-histories. This will prove 
much easier, since the set r-states in (£ for permutations of size m < n are 
a subset of the set of states for size n, modulo differences in count of the 
number of elements in input and output. This is the gist of the subproblem 
decomposition which will be used for the relativistic algorithm. In order 
to formalize this decomposition, however, we would like to introduce a 
few more concepts. 

We propose to let an epoch be a section of a r-history which is tied 
to a certain level of the stack of r-twinstacks. The base epoch, associated 
with the lowest r-twinstack, will be the r-history itself. We might then 
associate one or more epochs with the second r-twinstack, and still more 
with the r-twinstack above that, etc. 

An epoch £ at a higher level can be viewed as a subpath of the total 
r-history. However, we say that E has its own perspective from which it 
sees itself as the base epoch at the level of the bottom twinstack. Thus E 
views itself as the r-history created by taking the subpath of the original 
r-history and removing from each of the states in this sequence all of the 
r-twinstacks below E's level. 

Let us now give a formal definition of an epoch. 

def 

= Given an r-history (which is a path in the transition 
graph on the set of r-states £), an epoch at level h (for h > 0) 
is defined as a "subpath" x' y' s.t.: 
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• The epoch begins at some r-state x where the r-twinstack 
at level (h — 1) is nonempty and the r-twinstack at level 
h is empty. 

• The epoch ends at the r-state y which is the first r-state 
following x in the path such that the r-twinstack at level 
(h — 1) is modified in the transition to r-state y. 

And where the r-states in the "subpath" x' y' are 

created from the actual subpath x —>■...—► y by removing all 
of the r-twinstacks for levels through (h — 1) and by decre- 
menting the input and output element counts such that the 
output count is zero at x' and the input count is zero at y' . 
(When h = 0, the starting and ending conditions are waved, 
and we say that there is a single epoch corresponding exactly 
to the r-history.) 

Each epoch is itself an r-history for the sorting of some smaller per- 
mutation. (Notice that the pile of r-twinstack is always empty at the end 
of the epoch because, if the epoch is not the original base epoch then it 
must end when the r-twinstack just below its level is modified by either a 
pop or weld operation, and in either case the r-twinstack at its level must 
be empty.) Furthermore, since no epoch can end before any epochs above 
it end (by the requirment of having empty r-twinstacks at the end of an 
epoch), the epochs are properly nested. Every epoch at level (h — 1) can 
contain zero or more epochs at level h. 

To make the epoch decomposition of a given epoch/r-history well de- 
fined, we say that an epoch begins at level h any time that the current 
r-state has a nonempty r-twinstack at level (h — 1) and there is not already 
an existing epoch at level h. Thus every epoch which includes an r-state 
with a non-empty r-twinstack must contain one or more child epochs at 
the next higher level, while any epoch which includes only r-states with 
empty r-twinstacks has no child epochs. 

( ) 
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Figure 6: Example decomposition of an r-history into epochs 
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It should be clear that this nested tree of epochs, each corresponding to 
a smaller r-history, gives a subproblem decomposition which we can use to 
address the task of calculating the number of r-histories of a given length. 
Before we finally address this directly, however, allow us to introduce one 
more key definition relating to the ways in which an epoch can end. 

An epoch always ends either by popping all of its elements and then 
popping an element of the r-twinstack below that epoch, or by welding all 
of its elements onto the r-twinstack below that epoch. (The exception is 
the original bottom epoch, or root epoch, which ends where it pops all of 
its elements and there are no elements remaining in the input.) Thus, the 
ways in which the r-twinstack belonging to a given epoch can be modified 
are tied to the ways in which the epoch at the next level above can end. 

Recall that every epoch/r-history can be lifted to some successful R-T 
history. Since, whenever we weld in an R-T history there can only be 
one element i which is too big and is thus preventing the pile from being 
normal, this is also the case in epochs/r-histories. Thus, whenever we weld 
k elements onto some r-twinstack t, we know exactly what form they will 
have. There will be one large element i which is larger than some element 
in t, and on the opposite side will be welded k — 1 smaller elements which 
are smaller than every element in t. Thus the integer k > 1 completely 
describes the possible transitions that can be caused by the weld. 

def 

= We say that at the end of every epoch E, E sends a 
signal k to the epoch below it, with k = if E ends by popping, 
or k equals the number of elements being welded down if E ends 
by welding. 

The information sent as a signal by an epoch E at its termination 
is exactly what is needed to determine the ways in which the r-state can 
change at the epoch below E when epoch E ends. We can also think of an 
epoch as sending a signal even at some times when it is not terminating, 
whenever it presents the opportunity for the epoch below to modify its 
r-twinstack. Namely, we can view an epoch E as sending k — whenever 
it pops every element from its r-twinstack, and as sending the signal k > 
whenever it places a new element i at the very bottom of its twinstack 
(either through welding or through pushing from the input) where k is 
the number of elements in its r-twinstack. 

Viewed in this way, the epoch below E is presented with opportunities 
to modify its twinstack. It may ignore some of these signals (leaving its 
r-twinstack unchanged and the epoch E unterminated). At some point, 
though, the epoch bellow will accept one of these signals, modify its r- 
twinstack, and end epoch E. 

We are now ready to present the subproblem definition which we will 
use for the relativistic enumeration algorithm. Let us define a new map 

h (m, k) 

to count the number of r-histories (alternatively the number of epochs) 
which take m steps and then end by sending signal k. 

More generally, we want to consider starting not just from the r-state 
with the empty pile of r-twinstack, but from the r-state whose pile contains 
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exactly one r-twinstack S which may or may not be empty. Thus we let 

h (S, m, k) 

be the map counting the number of "r-histories" starting with r-twinstack 
S, taking m steps, and then ending signal k. (We place r-histories in 
quotes because, formally, we defined r-histories to only start with the 
empty r-state.) 

Suppose we can give an efficient recursive calculation for h (S,m, k). 
(We will soon do so.) Then we will have solved the problem of enumerating 
the number of parallel stack sortable permutations of length n, since this 
is equal (by Lemma 7.2) to the number of r-histories of length n which end 
with all elements in the output, and these are counted by h ( ( ) , n, 0) 

(where ( ) denotes the empty twinstack). 

Therefore, all that remains is to show how to compute h (S, m, k) re- 
cursively. The recursion will be on the number of steps, m. 

Recursive Case (m > 1): 

Consider first the case where S = ( ). Clearly, the first step can 

transition to one of two different states. Either the state whose r-twinstack 
contains one element, ( • ) , or (with an immediate pop to the output) 
the state whose r-twinstack is still empty, ( ) . Therefore, 

h(( ) ,m,k) = h (( • ),m-l,k)+h(( ),m-l,k) 

Now consider the case where the r-twinstack S is not the empty r- 
twinstack. There are two possible subcases. The first subcase is where 
the r-twinstack S first changes to some r-twinstack S' 7^ S after i steps 
for some integer i less than m. This change must correspond to a signal, 
j, received from the epoch ending when the change occurs. 

( I 

' ii 

S S ... S S' 

i steps 

Figure 7: Example of signal passing in the case where the nonempty r-twinstack 
S changes after i steps for some integer i less than m 

For each selection of 1 < i < m, once can consider every possible signal 
j that could be recieved (since each such j corresponds to a distinct subset 
of the possible epochs at the next level beginning at the start and lasting 
for i steps). Then, for each possible signal j, there may be many possible 
S's that are reachable from S given that signal. Thus the number of 



epochs belonging to this subcase is given by. 
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The second subcase is when the epoch's r-twinstack, S, remains un- 
changed all the way till then end of the epoch. Finally, after m steps, the 
epoch must receive a signal k' from its child epoch which causes it to send 
signal k to its parent epoch. 




s s ... s 




m steps 



Figure 8: Example of signal passing in the case where the nonempty r-twinstack 
S remains unchanged up till the end of the epoch 

Because the epoch is expecting to send signal k immediately upon 
receiving signal k' , the set of signals that it can receive and still accomplish 
this with is very limited. For example, if the epoch intends to send the 
signal k — 0, then this cannot be accomplished if the received signal k' is 
nonzero (a weld signal). However, this can always be accomplished if k' 
is equal to zero. Thus, for k = 0, k' must be uniquely determined to also 
be zero. 

When, on the other hand, the signal k is greater than zero, this indi- 
cates that the epoch ends by welding k elements. A welding end to the 
epoch clearly cannot occur if the signal received and accepted from the 
child epoch is k' — (since that would indicate that we pop from S rather 
than welding its elements down). If k' is a weld signal, then the set of 
elements welded down from the epoch must include every element it S 
along with each element counted by k' . Therefore, the only value which 
could possibly work for k' is the difference k — \S\ (where 15*1 naturally 
denotes the number of elements in r-twinstack S). Notice that whenever S 
is one-sided, it is always possible to send signal k upon reception of signal 
k' = (k — \S\). Alternatively, whenever S is double-sided, it is impossible 
to send a weld signal, since this requires receiving a new element i which is 
the largest in the r-twinstack S which would cause the sorting algorithm 
to fail (and we are only considering successful r-histories of the sorting 
algorithm). Thus k' , if it exists, is uniquely determined as a function of 
S and k. By choosing k' = — 1 when no k' could allow us to send signal 
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k, we get the following formula for k! (S, k). 



k'(S,k) 



if k = 

\S\ — k if S is one-sided and \S\ < k 
— 1 otherwise 



The number of epochs belonging to this second subcase is thus 



fc(( 



),m,k') =h(( • ) ,m- l,fc') ) . 



l,fc') 



(This works with the choice of k' = — 1 when no signal fc' could enable 
the sending of signal k because h (S, m, —1) is zero for all S and m.) 

Together, the two subcases where S changes to S' after i < m steps 
and where S remains unchanged until after m steps clearly count all pos- 
sibilities for the epochs starting with r-twinstack S, taking m steps, and 
then ending by sending signal k. Therefore we get the following recursive 
definition of h (S, m, k) for the case where S is not the empty r-twinstack. 



h(S,m,k)= EE 

i=l j>0 



,%3) 



,k) 



S'^S reachable 
from S with 
signal j 



+ h((» ),m-l,k')+h(( ),m-l,k') 

We can arrive at a single recursive formula for h (S, m, k) in both the 
case where S is and is not the empty r-twinstack by using the indicator 
function > 0} . Then 



h(S,m,k) = l{\S\ >0}EE 

i=l j>0 



E h(S',i 



S reachable 
from S with 
signal j 



+ h(( • ),m-l,k')+h(( 



, m — 1, / 



whenever m is greater than 1. 

Since this recursive formula gives h (S, m, k) in terms of the values of 
the h function for strictly smaller numbers of steps, all that remains is to 
describe how to calculate the base case h (S, 1, k) for all S and k. 



Base Case (m = 1): 

It is immediately clear that h (S, 1,0) is always 1 regardless of the value 
of S. (There is always a unique r-history of length 1 which ends by 
popping, namely the history created by taking the next element of the 
input, popping it, and then popping everything else as well.) 

For the case where k > (where the epoch ends by welding, there is 
always at most one r-history of length 1 which ends by welding k elements, 
because any such history must take the next element i from the input and 
then weld it to the bottom of the r-twinstack. These situation actually 
exactly parallels the second subcase of the recursive case, in which S 
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remains unchanged until after the mth step and we wanted to find a signal 
k' whose reception would allow the sending of signal k. Here, however, 
the signal k' is limited to being 1. Thus we can only send signal k when 
S is one-sided and \S\ = k — 1. 

Therefore, the value of h (S, 1, k) is given succinctly by 



h(S,l,k) 



1 if k — or if S is one-sided and l^l = k 
otherwise 



Together with the previously derived recursive case, this gives a com- 
plete description of h (S, m, k). 

We claim that this recursive description can easily be turned into a 
memoized dynamic algorithm to compute \C n \ = h (( ),n, 0) in 

O (n 5 2 n ) time. However, since this algorithm actually shows up as a 
special case of the version for the deque case, we will wait to describe it 
in the next section. 



8 The General Relativistic Algorithm for 
Counting Sortable Permutations 

We now wish to derive a modified version of the recursive function from 
the previous section which can be used to calculate the number of deque 
sortable length-n permutations. We begin by reviewing the modifications 
that were needed to adapt the Rosenstiehl and Tarjan's algorithm to work 
for deques. Recall that there were two such modifications. 

1. Whenever an element is welded down to become the very bottom 
element of the bottom twinstack, we tuck that element under the 
side stack containing any other elements (instead of leaving it on 
the opposite side and thus creating a double-sided twinstack). 

2. Whenever an operation popping an element element from the bottom 
twinstack causes the deque to become monotonic (so that all the 
elements except possibly the largest one are together in one of the 
side stacks), we tuck the largest element as needed to make the 
twinstack single-sided. 

Clearly, these changes induce changes in the possible transitions for an 
r-state. We can easily visualize the result of these changes. 

However, it is important to note that these changes only affect the 
transitions that involve the actual bottom r-twinstack. Therefore, for 
any higher level epochs, the number of r-histories remains completely 
unchanged. Thus, our subproblem decomposition into epochs will involve 
some subproblems using the new transition rules for their r-twinstacks, 
as well as some subproblems using exactly the same transition rules that 
we considered for the parallel stack case. This motivates the definition 
of a new map h(S,m,k,b) which gives the number of epochs/r- histories 
starting with r-twinstack S, taking m steps and then ending by sending 
signal k, given the transition rules for bottom r-twinstacks if b = 1 and 
given the transition rules for non bottom twinstacks if b = 0. Here, the 
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Welding: 



Popping: 




Figure 9: Transition differences 



new argument, b, is simply a binary flag telling whether the r- histories we 
are counting are at the bottom level or not. 

Note that h (S, m, k, 0) is just our previous map h (S, m, k), so h (( ) , n, 0, 0) 

still counts |C„|. When we consider h (( ) ,n, 0, l), however, we are 

counting the number of bottom level successful r-histories of length n us- 
ing the deque transition rules. Thus/i(( ) ,n,0,l) counts [D n \, and 
an algorithm which can compute h (S, m, k, b) can compute the number 
of sortable permutations of length-n for both parallel stacks and deques. 
(This was our motivation for delaying a complete algorithm description 
in the previous section.) 

Constructing the Recursive Formula for h(S,m,k,b): 

There are two ways in which the recursive formulas for h (S, m, k) needs 
to be modified to give recursive formulas for h(S,m,k,b). We need to 
correctly choose the values of b to be passed to the recursive calls, and we 
need to use the correct transition rules given the passed parameter b. 

The transition rule appears nowhere in the base case, and only in two 
places in the recursive case. When we consider r-histories which start 
with r-twinstack S and then modify their r-twinstack to S" after i steps, 
we summed over all S' S such that S' was reachable from S given signal 
j. For the new formula, the set of states which are reachable depends on 
the value of 6, so we simply change the sum to be over all S" 7^ S such that 
S' is reachable from S given signal j and the transition rules corresponding 
to b. 

The second place that the transition rule appears is in our statement 
that the only two r-twinstack states that can be transitioned to from the 
empty r-twinstack ( ) in one step are the empty r-twinstack, and 

the r-twinstack with one element, ( • ). Clearly, this is still the case 
regardless of whether we are using the transition rules for the bottom 
epoch or not, so the addition of b does not necessitate any change to that 
section of the formula. 

Regarding passing the correct values of b to the recursive calls, we 
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simply need to identify which recursive calls are counting epochs at the 
current level (these are passed the current value of b) and which recursive 
calls are counting epochs at the next level up (these are always passed 
as their last argument). Thus we get the following recursive formula for 
h (S, m, k, b) when m > 1. 



h(S,m,k,b) = t{\S\>0}Y / J2 

i=l j>0 



h(( ),i,j,0) h(S',m-i,k,b) 



S'^S reachable 

from S with 
signal j given b 



+ h((» ),m-l,fc',l{|S|=0})+h(( ),m-l,*',l{|S|=0}) 

The formula for the base cases of h (S, m, k, b) remains unchanged (so 
h(S,l,k,l) = h {S, 1, k,0) = h (S, 1, fe)). 



The Algorithm: 

We are now ready to describe an efficient memoized dynamic program 
algorithm to compute h (S, m, k). 

We will begin by describing the helper function Get-R-Twinstack- 
Transition-List to efficiently compute all the S's we can transition to given 
a specific S, j, and 6. Note that throughout the following implementa- 
tions it will be convenient to always place the smallest element on the left 
stack. 

Consider the following psuedocode for Get-R-Twinstack- Transition- 
List. 
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Procedure Get-R-Twinstack-Transition-List (S, j, b) 

let result be a new list 

let Is be the number of elements in S 

let zs be the index of the first zero element in S (equal to Is if S 
contains no zeros) 

let ms be the maximum size postfix of S which has the property of 
being monotonic 
if j > then 

let smallElements be a new list containing j — 1 ones 
for i = 1 to zs do 

let X be a new copy of S 

insert a into X at index i 

if b —— 1 and i = Is then 



prepend smallElements to X 
add X to result 

else 

for i = to Is do 

let X be a new copy of S 

remove the first i elements from X 

if the first element of x is a then 



add X to result 
_ return result 



Lemma 8.1. Get-R-Twinstack- Transition-List is correct. 

Proof. First consider the case where j is greater than zero. Here, we are 
welding j elements onto the r-twinstack. As discussed previously, exactly 
one of these elements, y, must be larger than at least one element in S. 
The other k—1 element must be smaller than than every element in S and 
on the opposite side of the incoming weld from the element y. Since the 
k—1 smaller elements (if any) will be the smallest elements in each of the 
resulting r-twinstacks, we place them on the left side at the beginning of 
each new r-twinstate, X. We then generate every every new r-twinstack X 
over all possible placements of the element y into the right side of X after 
the first element of S and before the first right-side element of S. This 
process enumerates all of the possible transition states if the r-twinstack 
is not the bottom stack (as indicated by b — 0). 

Alternatively, if b = 1 and the last of the enumerated Xs placed the 
element y as the last element, we move y to the bottom of the left stack 
instead (to preserve one-sidedness of the monotonic state). Thus the re- 
turned list of Xs is exactly the correct set of transition r-twinstacks. 

Now consider the case where j is zero. Here we enumerate the results 
of popping any number of the smallest elements (from none of them to 
all of them). If the set of elements popped would leave the top (smallest) 




switch the value of each element of X 



if b —— 1 and (Is — i) < ms and X is not empty then 
set the last element of X to be a 1 
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element on the right stack, we reverse the resulting r-twinstack X to 
maintain the invariant that the smallest element is always on the left. 
This will correctly give every possible transition r-twinstack when S is 
not the bottom r-twinstack. 

For the case when 6=1 (indicating that S is the bottom r-twinstack), 
we check whether each resulting r-twinstack X is monotonic. If it is, it 
must either have the form of having no right stack elements, or having a 
single largest element in the right stack after some number of left stack 
elements. We simply change the latter case to the former by specifying 
that the last element, whatever it is, must be in the left stack. 

Thus, for each triple of arguments S, j, and b, Get-R-Twinstack- 
Transition-List(S, j, b) returns a list with each r-twinstack S" reachable 
from S upon reception of signal j given b. □ 

Lemma 8.2. Get-R-Twinstack- Transition-List can be implemented to 
run in O (\S\) time. 

Proof. For every conceivably computable value of n, we can choose an 
integer datatype having enough bits to represent each possible twinstack. 
Let the last Is of these hold the bits indicating the position of elements 
in S, and let all the higher order bits be zero. Then we can perform all 
of the required operations of copying S, inserting elements into S, setting 
elements of S, reversing elements of S, and shifting elements of S, in 
constant time on standard architectures using bit-shift and bitwise logic 
operations. 

Therefore, the only parts of this algorithm contributing a non-constant 
amount to the runtime are the computations of the variables Is, zs, and 
ms, and the two loops. Clearly, each of Is, zs, and ms can be computed 
in 0(151) time. Additionally, since the loops loop over zs and Is + 1 
indices respectively, and zs and Is are each bounded above by \S\, the 
loops also contribute no more than O (\S\) time. 

Therefore Get-R- Twinstack- Transition-List can be implemented to run 
in 0(151) time. □ 

Now consider the psuedocode for the full algorithm. 
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Procedure Relativistic-Histories (S, m, k, b) 
global dictionary 

if dictionary .hasKey((S,m,k,b)) then 
[_ return dictionary .getV al((S,m,k,b)) 

if m == 1 then 

/* the base case 

if k == or (\S\ == (k — 1) and S is one-sided) then 
| result — 1 

else 

result — 

dictionary. add((S, m, k, b), result) 
return result 



*/ 



else 



/* the recursive case 
if \S\ == then 

result — 

Relativistic-Histories(( • 
Relativistic-Histories ( ( 

else 

result — 

k' = Get-KPrime(5, fc) 
if not k' == —1 then 

result+ = 

Relativistic-Histories ( 
Relativistic-Histories ( 



*/ 



, m 
, m - 



■ l,k,b) + 
l,fe,W 



, m 
, m - 



- l,fc',0) + 
l,fe',0) 



for i = 1 to m — 1 do 
for j = to i + 1 do 

let transitionList = 

Get-R-Twinstack-Transition-List (5*, j, b) 
for 5*' in transitionList do 
if not 5' == S then 

result+ = 

Relativistic-Histories ( ( ) , i, j, 0) • 

Relativistic-Histories (5", m — i,k, b) 



dictionary. add((S, m, k, b), result) 
return result 



Where Get-KPrime(5', k) is just the previously described map 
k'(S,k) 



if k = 

\S\ — k if S is one-sided and \S\ < k 
— 1 otherwise 
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Procedure Relativistic-Sortable-Count (n, 6) 
if b —— 1 then 

/* the deque case */ 
return Relativistic-Histories ( ( ) , n, 0, 1) 

else 

/* the parallel stack case */ 
return Relativistic-Histories (( ) , n, 0, 0) 



Theorem 8.3. Relativistic-Sortable-Count is correct. That is, when 
called with b = 1, it returns \T> n \, and when called with with b = it 
returns \C„\. 

Proof. Rather than offer a lengthy proof here, we simply refer the reader 
to the reasoning in the above sections to see that the correct recursive 
formula for h (S, m, k, b) is indeed 



h(S,m,k,b) = l{\S\ >0} 

i=l j>0 



h(( ),i,j,0) h(S',m~i,k,b) 

S'^S reachable 

from S with 
signal j given fa 

+ h((» ),m-l,k',l{\S\=0})+h(( ),m-l,fc',l{|S|=0}) 



when m > 1 and 



h(S,l,k,b) 



1 if k — or if S is one-sided and |S| 
otherwise 



when m = 1. Clearly, Relativistic-Histories implements this recursive 
formula to return h (S, m, k, b). 

But we also know from Lemma 7.2 that the number of length-n per- 
mutations sortable on a deque is equal to the number of root r-histories 
(or epochs) of length n. Since a root r-history can only end by sending 
signal (since there is no r-twinstack below it which it can weld to), this 
latter quantity is exactly ),n,0,l), which is the value returned 

by Relativistic-Sortable-Count when 6=1. Thus Relativistic-Sortable- 
Count(n, 1) correctly returns \T> n \. 

Similarly, when Lemma 7.2 also implies that the number of length-n 
permutations sortable on a pair of parallel stacks is equal to the num- 
ber of root r-histories of length n using the transition rules for parallel 
stacks. Once again, a root r-history can only end by sending signal zero. 
Thus this count is exactly h ( ( ) , n, 0, 0) , which is the value returned 

by Relativistic-Sortable-Count when 6 = 0. Thus Relativistic-Sortable- 
Count(n, 0) correctly returns \C„\. □ 

Theorem 8.4. Relativistic-Sortable-Count has time complexity O (n 2 n ) 
and space complexity O (n 2 2 n ) . 



37 



Proof. First consider the runtime of Relativistic-Sortable-Count. The 
Relativistic-Sortable-Count subroutine clearly only contributes constant 
runtime, so any non-constant factors must come from calls to Relativistic- 
Histories. Thus we need to determine the max runtime of any given call to 
Relativistic-Histories, along with the number of such calls that are being 
made. If Relativistic-Histories finds that the desired value has already 
been computed and stored in the memoization dictionary, then its run- 
time is (ostensibly) constant. Otherwise, if m = 1 then its runtime is still 
constant. Finally, if it needs to compute the value for m > 1, then it 
does so in a nested for loop where the two outer loops can iterate order 
m = O (n) times. Inside these two loops is the call to Get-R-Twinstack- 
Transition-List (with an associated runtime of O (n)) and the third loop 
which iterates over the return list (again O (n)). Thus, in the worst case, 
Relativistic-Histories histories takes O (n 3 )time to run. 

When we consider the number of calls made to Relativistic-Histories, 
we only need to consider calls made where the desired value has not yet 
been computed. (This is because whenever we have memoized the value 
for a certain set of args, the runtime of Relativistic-Histories is constant 
and is therefore taken care of by the computation for the runtime of the 
caller.) The number of such calls to Relativistic-Histories is limited by the 
size of the domain of the function h (S, m, k, b). Since S can range over 
binary strings of length n, m and k are both order n, and b has only two 
values. The size of this domain is O (n 2 2 n ). Therefore, the total runtime 
of all calls to Relativistic-Sortable-Count has time complexity O (n 5 2™). 

The space complexity of Relativistic-Sortable-Count is just the size of 
the memoization dictionary, which is limited by the size of the domain 
of the map h (S, m, k, b). Therefore Relativistic-Sortable-Count has space 
complexity O (n 2 2 n ). □ 

9 Results Obtained with the Relativistic 
Algorithm 

As described in section 6, our most efficient implementation of the old 
approach for computing \T> n \ and \C„\ using tree search was only successful 
for up to n — 14. In fact, that implementation was written in C with 
careful consideration to factors like avoiding memory allocation, and it 
still had to be run overnight in order to compute the results for n = 14. 

Our first (and so far our only) implementation of the relativistic al- 
gorithm is in python using the built in types, with no special emphasis 
on efficiency. Such an implementation can be many orders of magnitude 
slower than a good C implementation. (For example, we took same ap- 
proach of first implementing the tree search algorithm in Python before 
coding it in C, and that implementation was limited to n — 10.) Nev- 
ertheless, because of the greatly improved asymptotic runtime our new 
Relativistic-Sortable-Count algorithm, we were able to compute all of the 
values of \T> n \ for up to n = 21 in under twelve minutes. Similarly, we 
computed \C n \ for up to n = 22 in under twenty-two minutes. We have 
included these table of numbers as appendix A and B respectively. 
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While the new relativistic algorithm is much faster than the previous 
best approach, this speed does come with a price. Relativistic-Sortable- 
Count has a space complexity of 0(n 2 2 n ), as compared to the linear 
space complexity of the tree search algorithm. (Note that this is still an 
improvement over the space complexity of Zimmermann's algorithm which 
had an exponential term whose base was greater than the growth rate of 
the permutation class.) Because of this, the algorithm failed to compute 
\T>22\ or IC23I on the linux machines on which I was running it, presumably 
because the python dictionary tried to grow to well over fifteen million 
elements which lead to thrashing. 

The simplest possible approaches to this problem would a more effi- 
cient custom hash storage solution, or even just running the algorithm on 
a machine with more memory. These could probably be used to acquire a 
few more terms of the sequence. A better long term approach (suggested 
by Peter Doyle) would be to develop a better understanding of the depen- 
dencies among the values of the map h (S, m, k, b). This could be used to 
try to redesign the algorithm to use an access order that is less affected 
by paging to disk. We leave such changes for future work. 

We should note before moving on, however, that fifteen million is much 
less than (22 2 • 2 22 ). Thus the domain of the map is only being sparsely 
populated, and the O (n 2 2 n ) space complexity (and O (n 5 2 n ) time com- 
plexity) limit may be quite conservative. 

10 Some Observations On Deque-Sortability 
Given Imperfect Information 

The problem that initially caused us to start looking at the class of per- 
mutations which were sortable on a deque (T>), was Peter Doyle's proposal 
of a game he called Double- Ended Knuth (or DEK for short). To borrow 
Doyle's description: 

DEK is a bare-bones relative of familiar solitaire games like 
Klondike. In DEK, we use a one-suit deck consisting of only 
the thirteen hearts (say). We shuffle the deck thoroughly, and 
place the deck face down on the table. The goal is to end with 
the cards in a pile face up, running in order from ace to king. In 
addition to the deck and the pile (initially empty), we maintain 
a line of cards (initially empty), called the deque, spread out 
face up on the board. At any point, if the next card needed 
for the pile is available as the top card of the deck or at either 
end of the deque, we may move it up to the pile; otherwise, our 
only option is to move the top card of the deck to either end 
of the deque. [3] 

Thus, DEK is the problem of sorting a permutation of length 13 on 
a deque given imperfect information. If the cards forming the input per- 
mutation were visible face up, then one could simply run our corrected 
version of the Rosenstiehl-Tarjan algorithm to determine whether or not 
it was sortable and if so how to sort it. Instead, however, we are forced to 
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make he decision of which end of the deque to add a the top card of the 
deck to immediately after having revealed that card and before viewing 
any of the other cards remaining in the deck. 

The vast majority of our work has been spend investigating the omni- 
scient case. Nevertheless, we offer a few remarks about this problem. 

Theorem 10.1. The distinction between sorting with complete informa- 
tion and sorting with incomplete information is important. That is, one 
cannot choose a strategy for the incomplete information case which will 
succeed on all sortable inputs. 

Proof. Consider the pair of permutations it = 7526431 and o — 7524163. 
After revealing the first three elements of these permutations and adding 
them to the deque, there are (up to reflection) two possible states for the 
deque, namely 

a) 257 and b) 572 

Clearly, state a) can be used to sort permutation n (by adding the 
6, 4, 3 and then the 1 to the right side of the deque and then popping 
everything). If however, the remainder of the permutation happens to be 
a, then the sorting attempt will fail since the 4 will be forced to be placed 
to the right of the 7 and then the sequence 574 will still be on the deque 
when the 6 must be placed. 

Alternatively, state b) can be used to sort permutation a. This can 
be done by adding the 4 to the left end next to the 5, and then adding 
the 1 and sending both the 1 and the 2 to the output. The state of the 
deque is then 457, and the 6 and then the three can be added to the right 
side before popping all of the elements to the output. The state b) fails, 
however, to sort 7r since the very next element, the 6, cannot be placed 
without sandwiching either the 5 or the 2. 

Therefore, even though any permutation in the set {ir, o} could be 
sorted on a deque given complete information, it is possible that in trying 
to sort a permutation from this set with incomplete information we could 
fail because we are forced to make a choice about the placement of the 
third element, and either choice will preclude the possibility of sorting one 
of the permutations in this set. □ 

The problem when sorting given incomplete information, as illustrated 
in the above theorem, is that we must sometimes make a choice between 
either of two possibilities for the placement of an incoming element such 
that either choice will rule out the possibility of sorting some subset of 
the permutations in V. One wonders, therefore, what are the necessary 
conditions for a choice that can affect the sorting success. 

Theorem 10.2. In order for the player of a game of DEK to come across 
a choice which could affect their scoring success, the following conditions 
are necessary and sufficient. 

1. The deque must already contain two distinct elements. (Let i denote 
the smaller of these, and let j denote the larger.) 

2. The incoming element must be smaller than both end elements of 
the deque. 
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3. There must be a gap of at least two elements between the value of 
the incoming element and the smaller of the two end elements of the 
deque. 

4. The incoming element must not be the next element required by the 
output. 

5. There must be an element larger than i which is still in the input. 

6. If the deques state is non-monotonic, then there must be an element 
larger than i but smaller than j which is still in the input. 

Proof. (Necessity): Clearly the deque state is equivalent (up to symmetry) 
regardless of the player choice if the deque contains one or fewer elements. 
Therefore, condition 1 is a necessary condition for being faced with a 
substantive choice. 

Now suppose that the incoming element is larger than the smaller of 
the two distinct end element of the deque. Clearly this incoming element 
cannot be placed next to the smaller of the existing end elements, or 
that smaller end element would become sandwiched and the game would 
certainly be lost. Therefore, if there are two distinct end elements on the 
deque and the incoming element is larger than either of them, the players 
move is forced and no substantive choice exists. 

Next, for condition 3, suppose that the smaller of the existing end 
elements of the deque is i and that the incoming element is x = (i — 1) or 
x = (i — 2) (so that no two element gap exists between them). Then we 
claim that it is always a safe play to place the new element next to i. 

Suppose that the permutation is sortable by placing x next to the the 
other, larger end element, j. Then, by placing x next to j to get the state 
iCjx (we assume wlog that i is the left end element), and then choosing 
future choices correctly, one will eventually arrive at a point at which x 
can be moved to the output. Consider the state immediately after this 
move. No element can be to the right of j on the deque. No element less 
than x can be on the deque. No element greater than i can be on the 
deque, since the placement of such an element prior to the removal of x 
would have pinned either i or x. Therefore, the state of the deque must 
be either 

iCj or (i — 1) iCj possible in the case where x = (i — 2) 

In the former case, the only elements appearing after the point where x 
appeared and before the point where x was popped were elements strictly 
less than x. Therefore we could just as easily have placed x next to the 
end element i. 

In the latter case, at the point when (i — 1) arrived, the only other 
elements which had already arrived but had not already moved to the 
output must lie in a sequence S such that the deque state at the moment 
of (i — l)'s arrival was iCjxS, where S is a sequence which is decreasing 
from left to right. None of the elements arriving between x and (i — 1) 
were larger than x, however, so by placing x next to the end element i and 
then inverting the placements of every element following x and preceding 
(i— 1), we could have the state SxiCj at the time of (i — l)'s arrival. 
By then placing (i — 1) on the right end, and continuing to invert the 
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placement of every element received between (i — 1) and the movement of 
x to the output pile, we see that it must also be safe to place x next to i. 
Thus no substantive choice is required if condition 3) does not hold 

Clearly, if x is the next element required by the output then doesn't 
matter where we place it since we can immediately get rid of it. 

For condition 5, suppose that every element larger than i has already 
been moved out of the input. Then all such elements must already be in 
the sequence Cj, and since we are assuming that we don't start with a 
sandwich state (in which case clearly no substantive choice can exist) the 
elements n through i must be ordered such that they are in decending 
order starting from the element n and reading either right or left. 

Therefore, the elements remaining in the input for any sortable per- 
mutation must all be moved to the ends of the deque and thence to the 
output before i or any other element of iCj is moved. This implies that 
whatever remains on the input is a parallel stack sortable permutation 
which can be sorted on the two parallel stacks radiating to the left and to 
the right of iCj, and clearly the choice of which of the two parallel stacks 
to add the first element to is arbitrary. 

Finally suppose that the deque is non-monotonic and that no element 
between the values of i and j is on the input at the time that x arrives. 
Then for any sortable permutation, every element of the input which is 
larger than i must wait to arrive until after i has been moved to the 
output. Thus every such element must follow every element smaller than 
i in the input. Therefore, the input has as a prefix some parallel stack 
sortable permutation consisting of all elements less than i and not yet 
in the output, and so once again the placement of the element x < i is 
unimportant. 

Thus all six conditions are necessary for the player to be presented 
with a substantive choice. 

(Sufficiency): Suppose that all six conditions are met. 

Subclaim. There is a permutation that is sortable only if x is placed 
next to the smaller end element, i. 

Proof. Suppose first that the deque is monotonic. Let z be some element 
which is larger than i and is still in the input. Then the permutation 
in which the input consists of xz followed by every remaining element in 
sorted order is clearly sortable if x is placed next to i but not if it is placed 
next to j . 

Now suppose that the deque is non-monotonic. Then condition 6 guar- 
antees that there is some element z which is in the input and has value 
between i and j. The same permutation is thus sortable if x is placed 
next to i but not if it is placed next to j. □ 

Subclaim. There is a permutation that is sortable only if y is placed 
next to the larger end element, j. 

Proof. Suppose first that the deque is monotonic. Consider the permuta- 
tion where x is followed by (i — 1), then by every element less than x not 
already in the output in sorted order, then by some z > i, and then by 
every remaining input element in sorted order. If x is placed next to j, 
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then (i — 1) can be placed next to i, and it is clear that the remainder of 
the permutation can be successfully sorted from this state. If, however, x 
is placed next to i, then the (i — 1) must be placed adjacent to j. After 
the sequence of elements less than x arrives and departs, the deque will 
still have a non-monotonic state with end elements i and (i — 1). Thus 
the element z will necessarily cause a sandwich. 

Now suppose that the deque is non-monotonic. Let z be the element 
whose existence is guaranteed by condition 6. Consider the same input 
permutation described above. Clearly this is still sortable given the place- 
ment of x adjacent to j. Once again, however, if x is placed adjacent to 
i, the element (i — 1) must go adjacent to j, and we end up with a non- 
monotonic stack with end elements i and [i — l)whcn z arrives. □ 

By the above two subclaims, whenever all six conditions are met, the 
choice presented to the player is substantive. Therefore these six condi- 
tions are both necessary and sufficient. □ 

Having determined when choices matter, we want to understand how 
to make the right choice. One obvious strategy for cases where we have 
sufficient computational power is: 

Strategy 1. Enumerate all possible remaining inputs, and make the 
choice that leaves more of these winnable. 

After identifying this strategy, however, we realized that it amounts 
to choosing based on which placement gives the player the most winnable 
scenarios given omniscient information in the future. Thus this is the 
optimal strategy for a modified version of DEK where the player plays 
till their first substantive choice, makes that choice, and then reveals the 
remainder of the input deck and trys to play on with complete information. 

The actual optimal strategy of DEK play is this one. 

Strategy 2. (optimal) Use a choice criteria C such which will lead to the 
most winnable scenarios when applied to this and all future choices. 

A priori, it seems possible that the scenarios which are winnable from 
one choice are more or less evenly split beneath a future choice, whereas 
the scenarios winnable from the alternative choice are not so limited by 
future choices. Thus we might imagine that these two strategies could 
disagree. In order to try to find an example where the disagreed, I wrote 
a persistent version of Rosenstiehl-Tarjan-Modified and then used this to 
calculate the decision of each strategy at each substantive choice encoun- 
tered in a search of the permutation tree. 

Surprisingly, for the small cases I tested (up to n = 12), we did not 
find any example where the selections made by Strategies 1 and 2 differ. 

11 Conclusions and Acknowledgements 

To sum up the main results of this work: We examined the deque sortabil- 
ity testing algorithm presented thirty years ago by Rosenstiehl and Tarjan, 
and identified an error in this algorithm. To the best of our knowledge, 
this flaw was previously unknown. (We have examined works which cite 
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Rosenstiehl and Tarjan's algorithm, and none of them address this issue.) 
Sadly, we have been unsuccessful in our attempts to contact the authors 
directly. After identifying the flaw in the Rosenstiehl and Tarjan's algo- 
rithm, we proposed a solution and then offered a proof that the modified 
version of the algorithm is indeed correct. 

We then developed a new algorithm for computing the number of per- 
mutations of a given size n which are sortable on either a pair of parallel 
stack or on a deque, which has a greatly improved asymptotic runtime 
when compared with the previous best approach to making these calcu- 
lations. Using our new algorithm we calculated the number of sortable 
permutations for several lengths beyond what was previously known. 

Finally, we have presented a description of exactly when, in attempt- 
ing to sort a permutation given incomplete information, one must make a 
choice which effects the set of permutations for which the sorting compu- 
tation being attempted can succeed. 

I would like to thank both of my advisors on this project: Scot Drys- 
dale, who has always been very supportive has given excellent feedback in 
putting together this project, and Peter Doyle, without whom this work 
and my time at Dartmouth in general would have been greatly impover- 
ished. 

I would also like to thank Professor Prasad Jayanti for kindly serving 
on my thesis committee. 

Finally, I want to thank Sergi Elizalde who taught my combinatorics 
courses addressing permutations and permutation patterns, and who pro- 
vided great expertise that I should have taken advantage of sooner. 

Appendix A 

This table lists our computed values of \T> n \ for n = 1, . . . , 21. The first 
fourteen terms of this sequence appear in the online encyclopedia of integer 
sequences as sequence A182216. 



1 51069582 

2 365879686 
6 2654987356 
24 19473381290 
116 144138193538 
634 1075285161294 
3762 8076634643892 
23638 61028985689976 
154816 463596673890280 
1046010 3538275218777642 
7239440 
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Appendix B 



This table lists our computed values of \C n \ for n = 1, . . . , 22. 



1 


24180340 


2 


161082639 


6 


1091681427 


23 


7508269793 


103 


52302594344 


513 


368422746908 


2760 


2620789110712 


15741 


18806093326963 


93944 


136000505625886 


581303 


990406677136685 


3704045 


7258100272108212 
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